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USING DATA OBTAINED BY THE QUALITY MEASUREMENT PROJECT 
ON SOME 70 1 000 PUPILS i THIS STUDY ATTEMPTED TO PRODUCE 
EVIDENCE REGARDING THE INFLUENCE OF CLASS SIZE AND CLASS 
HOMOGENEITY ON ACHIEVEMENT GAINS IN GRADES 7 AND 8. A TOTAL 
OF 130 ENGLISH AND 135 MATHEMATICS CLASSES CLASSIFIED 
ACCORDING TO SIZE AND HOMOGENEITY WERE EXAMINED. THE READING 
COMPREHENSION AND ARITHMETIC TEST SCORES ON THE IOWA TEST OF 
BASIC SKILLS WERE USED AS THE MEASURES OF ACHIEVEMENT. 

RESULTS INDICATED THAT GAIN DIFFERENCES IN RESPECT TO CLASS 
SIZE AND CLASS VARIABILITY WERE GENERALLY VERY SMALL AND 
INCONSISTENT. BECAUSE TWO-THIRDS OF THE CLASSES STUDIED 
CONSISTED OF FROM 23 TO 32 PUPILS i THE LARGEST AND SMALLEST 
CLASSES (LARGER THAN 34 AND SMALLER THAN 24 STUDENTS) WERE 
ISOLATED FOR SEPARATE COMPARISON. RESULTS CONFIRMED THAT 
THERE WAS NO INSTANCE OF A SIGNIFICANT DIFFERENCE IN 
ACHIEVEMENT GAIN EVEN BETWEEN THESE EXTREME GROUPS. ALTHOUGH 
THESE TESTS DO NOT MEASURE ALL TYPES OF ACHIEVEMENT i THEY DO 
SUGGEST THAT ATTENTION MIGHT MORE PROFITABLY BE DIRECTED 
TOWARD REDUCING THE NUMBER OF CLASSES ASSIGNED TO ONE TEACHER 
THAN TOWARD REDUCING THE SIZE OF THE CLASSES THEMSELVES. 

(THIS ARTICLE APPEARED IN *THE SCHOOL REVIEWi" VOL. 75 i NO. 

3 1 AUTUMN 1967.) (DL) 
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Class Size and Achievement Gains 
in Seventh- and Eighth-Grade 
English and Mathematics 



“For virtually the entire history of modem secondaiy education in 
this Nation . . . a report of the U.S. Office of Education notes, 
“there has been endless discussion over the proper number of pupils 
to be assigned a given secondary school class.”^ The common as- 
sumption has been that students achieve better in small classes. This 
study examines some evidence bearing on that assumption. 

Current proposals by Trump and Baynham^ and others tend to 
change the nature of the problem. Schools are urged to make flexible 
use of classes of various sizes for different kinds of learning activity. 
The effects of such variation on achievement will merit study, but 
the data presented here do not concern this practice. They deal 
with the conventional, and still most prevalent, arrangement where- 
by all classes are somewhat similar in size, are taught by one teacher, 
and do not vary in composition from day to day. 

Obviously, many factors determine the sizes of instmctional 
groups in schools. The nature of the subject, the ability levels of 
groups, the capacities of available classrooms, and relative costs 
must all be taken into considerationi In terms of the additional 
teachers and classrooms that would be needed, a reduction of aver- 
age class size by even one pupil throughout the nation would be 
enormously expensive. There may be many values of smaller classes 
which would amply justify the increased costs involved. But to the 
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extent that better academic achievement is one of the expected 
benefits, it is important to know what relation, if any, there is be- 
tween class size and achievement. 

Average class size in junior high schools has been reported® to be 
about 31. An oft-recommended goal, which accords with teachers' 
preferences,^ is a class size of 25. Thus, it is within this narrow 
range of 25-30 that the controversy centers. The average size of 
classes considered in the study reported here was between 28 and 
23. 

In 1964, Millman and Johnson® examined the achievement gains 
of a large number of instructional groups in terms of their homoge- 
neity with respect to initial achievement level. Further analysis of 
the same data now furnishes some evidence regarding the influence 
of class size on achievement gains. 

PROCEDUBE 

Data obtained by the Quality Measurement Project and made 
available by the New York State Education Department include the 
scores of large numbers of seventh- and eighth-grade students tested 
successively in 1957-58 and 1958-59 with the Iowa Basic Skills bat- 
teiy. Some 70,000 pupils in approximately a hundred representative 
school systems were tested. Class lists were obtained for thirty of these 
systems. Out of these were selected only those classes which were 
tested approximately a year apart and sufficiently early or late in the 
year so that the gain could appropriately be identified with either 
the seventh or eighth grade; only those classes in which the pupils 
still available for the second testing had mean initial scores within 
three-tenths of a grade of the mean initial score of the total class; 
and only those classes die true size of which was definitely known. 
A total of 130 English classes and 135 mathematics classes were 
available for study, 103 in grade 7 and 162 in grade 8. In many in- 
stances, as a result of conventional scheduling practices, pupils corn- 
prising an English class were the same ones who made up a mathe- 
matics class. Because this was not always the case, however, and 
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because certain sections did not meet the stated criteria, the two 
subjects, as well as the two grades, were treated separately. 

The ITBS reading comprehension test scores were used as the 
measure of the dependent variable for the English classes, and for 
the mathematics classes, the scores on the arithmetic test. Gain 
scores represent the diflFerence between the grade equivalent 
achieved in the seventh and the eighth grades. Classes were classi- 
fied as homogeneous or heterogeneous on the basis of the standard 

TABLE 1 

CUTTING POINTS USED TO CLASSIFY CLASS SECTIONS ON INITIAL 
ACHIEVEMENT LEVEL, VARIA73ILITY, AND SIZE 

(test data in grade EQUIVALENTS ON IOWA TESTS OF BASIC SKILLS) 



; 

;; ■ i 


Engusb Classes 


MATBEiuncs Classes 


i \ . CtASSinCATlOK 

•! ^ 


Grade 7 
(^7-48) 


Grade 8 
(A'- 82) 


Grade 7 
(A'-SS) 


Grade 8 
(A^-80) 


j j " 

Initial acliieveinent level 
i (section mean): 

I Above average 


7.851 and over 


8.900 and over 


7.551 and over 


8.300 and over 


1 Tielow average 


Below 7.851 


Below 8.900 


Below 7.551 


Below 8.300 


Class variability (standard 
, deviation): 

i Heterogeneous 


1.045 and over 


1.231 and over 


0.651 and over 


0.771 and over 


1 Homogeneous 


Below 1.045 


Below 1.231 


Below 0.651 


Below 0.771 


Class size (number of pu- 
: i pils): 

Large 


29 and over 


28'and over 


29 and over 


28 and over 


1 Small 

'1 f 


Below 29 


Below 28 


Below 29 


Below 28 



deviation of students* scores in the initial (seventh-grade) testing; 
they were classified by level as high or low on the basis of die mean 
initial scores. The cutting points in the triple dichotomization by 
size, variability, and level are given in Table 1. 



RESULTS 



The partitioning created eight categories of classes for each sub- 
ject in each grade, a total of thirty-two. The mean gain of each 
class was computed. The means of the mean gains for all classes in 
each of the categories are given in Table 2. The number of classes in 
each category is shown. 
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It is evident from Table 3 that the gain differences between large 
and small classes are generally very small and inconsistent. There 
are grade and subject differences as well as those among levels and 
variability. For both English and mathematics, however, seventh- 
grade gains were greatest for small heterogeneous classes, while in 



TABLE 3 

GAIN SCORE DIFFERENCES FOR TAAIN EFFECTS 



Subject and G^ase 


Class Size 


Small 


Large 


Difference 


English (7) 


0.77 

0.78 

1.11 

1.31 


0.69 

0.82 

1.00 

1.41 


-^b.08 

-0.04 

+0.11 

-0.10 


English (8) 

Math (7) 


Math (8) 


F.nglifiTi (7) 


Class VAtiABiuxy 


Homoge- 

neous 


Heteroge- 

neous 


Difference 


0.72 

0.80 

1.02 

1.39 


0.74 

0.80 

1.09 

1.34 


-0.02 

0.00 

-0.07 

+0.05 


English (8) 


Math (7) 


Math (8) 


FnpKfih (7) 


Itaxi/a Achievement Level 


Above 

Average 


Belov 

Aveiage 


DiUerence 


0.71 

0.78 

1.16 

1.62 


0.7S 

0.82 

0.95 

1.10 


-0.04 

-0.04 

+0.21 

+0.52 


En|ush(8) 

Math (7) 


Math (8) 





the eighth grade they were greatest for large homogeneous classes. 
In both grades, below-average classes showed greater gains than 
above-average classes on the reading test, while on the arithmetic 
test the gains were more in line with expectations. It is possible that 
there was a “topping” effect on the reading test or that schools 
tended to tiy more deliberately to improve pupils’ reading abilities 
in lower-ability classes. 
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The first-order inleraclion effects involving class size are indicated 
,-n TaHfi 4. In ten of the sixteen situations, gains were greatest for 



the larger groups. The simunary in Table 5 shows that large groups 
had the advantage seven out of eight iiimes for all eighth-grade 
dasses, seven out of ei^t times for below-average classes in both 
grades, six out of eight times for all malhemaUcs classes, and six 
out of ei^t times for heterogeneous dasses in both subjects. 



TABLE 4 

GAIN SCORE DIFEERENCES FOR FIRST-ORDER INTERACTION 
EFFECTS INVOLVING CLASS SIZE 





Gains 


Category 


Small 


Large 


Difference 


Above average homogeneous: 
ITnwIieli H'i 


0.82 




0.66 


+0.16 




0.82 




0.71 


+0.11 


Matli ^7^ 


1.35 




1.01 


+0.34 


Math (8) 


1.53 




1.76 


““0.23 


Above average heterogeneous: 
■Rmrlicli ^71 


0.89 




0.47 


+0.42 




0.76 




0.80 


—0.04 




1.43 




0.94 


+0.49 


Math (8) r - r 


1.52 




1.71 


—0.19 


Below average homogeneous: 

WnorlicTt ^71 a...... 


0.71 




0.67 


+0.04 


JIdllKUsu \ 9 /•••••••••••••••"•* 


0.80 




0.86 


-0.06 


IVfRth ^71 


0.77 




0.96 


-0.19 


Math (8) 


0.98 




1.19 


—0.21 


Below average heterogeneous: 


0.66 




0.67 


-0.01 




0.69 




0.87 


-0.18 


Mafli ^7^ 


1.03 




1.13 


-0.10 


Mafli (Si A « . 


1.04 




1.16 


—0.12 













Because so many interaction effects seemed to be present, a triple 
classification analysis of variance was carried out for both subjects in 
both grades. Since cell sizes were not equal, a table of random 
numbers was used to eliminate class sections whenever necessaiy 
to bring the number in each cell to five. A summary of the analysis 

is given in Table 6. 

It will be noted that only five effects proved to be statistically 
significant. None of the others was significant at even the .05 level. 
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TABLE 5 

SUMMARY OF RELATION OF CLASS 
SIZE ON ACHIEVEMENT GAINS 
FOR VARIOUS CATEGORIES 



Catsgories 


SiTUAnONS 

Favoizng 




SmaU 


Large 


Subject: 

English 


4 


4 


Mathematics 


2 


6 


Grade: 






7 


s 


3 


8 


1 


7 


Level: 






Above average. . . 


s 


3 


Below average. . . 
Variability: 


1 


7 






Homogeneous 


4 


4 


Heterogeneous. . . 


2 


6 



TABLE 6 

SUMMARY OF TRIPLE CLASSIFICATION ANALYSIS OF VARIANCE 



SoVXCB 



Class size 

Variability 

Achievement level 

SizeXvariabmty 

SkeXlevel 

VanabilityXlevel 

SizeX variability X level . 
Within 



Total* 



Class size 

Variability 

Achievement level 

SizeX variability 

SizeXlevel 

Variability X level 

SizeXvariabilityXlevel. 
Within 



Total. 



Ekgusb 



Grade 7 


Grade 8 


df 


Sum of 
Squares 


Mean 

Square 


F 


Sum of 
Squares 


1 Mean 
Square 


P 


. 1 
. 1 
. 1 
. 32 


8.99 

0.63 

1.35 

0.24 

66.72 

14.74 

14.77 

186.17 


8.99 

0.63 

1.35 

0.24 

66.72 

14.74 

14.77 

5.82 


1.54 

0.11 

0.23 

0.04 

11.46* 

2.53 

2.54 


8.58 

0.94 

4.92 

6.23 

3.13 

4.46 

0.43 

265.85 


8..'38 

0.94 

4.92 

6.23 

3.13 

4.46 

0.43 

6.65 


1.29 

0.14 

0.74 

0.94 

0.47 

0.67 

0.06 


. 39 


293.61 






294.54 














Matheiiatzcs 


1 

1 

1 

1 

1 

1 

1 

32 


15.90 

15.28 

88.21 

4.08 

138.01 

42.60 

2.07 

170.24 


15.90 

15.28 

88.21 

4.08 

138.01 

42.60 

2.07 

5.32 


2.99 

2.87 

16.58** 

0.77 

25.94 

8.01* 

0.39 


19.83 

0.56 

245.97 

8.35 

7.32 

2.64 

4.01 

310.59 


19.83 

0.56 

245.97 

8.35 

7.32 

2.64 

4.01 

9.97 


1.99 

0.06 

24.67** 

0.84 

0.73 

0.26 

0.40 


39 


476.39 






599.27 







* Significant at .01 level. 

•* Significant at .001 level. 
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\cliievement level is a signi£cant deteiminer of gains in mathemat- 
ics in both grades. Class size interacts with achievement level in 
both subjects, but only in the seventh grade. 

Although the classes studied varied in size from 10 to 36, two- 
thirds of them consisted of from 23 to 32 pupils. It may be argued 

table 7 

uedl\n achievement gains of the largest and smallest classes 



SoSJECT, GkADB, 
Tym 



Enj?Ush (7): 

Total 

Above average. . 
Below average, . 
Homogeneous. . . 
Heterogeneous. . 
English (8): 

Total 

Above average. . 
Below average. . 
Homo^neous. . 
Heterogeneous. . 
Math (7): 

Total 

Above average. 
Below average. 
Homogeneous. . 
Heterogeneous. 
Math (8): 

Total 

Above average. 
Below average. 
Homogeneous. . 
Heterogeneous. 





SUAU 




Lasgk 


N 


Med. 


Mied. 


V 


Med. 


Med. 


Size 


Gain 


XV 


Size 


Gain 


6 


18.5 


0.70 


11 


35.0 


0.53 


1 


24.0 


0.58 


6 


35.0 


0.49 


5 


18.0 


0.76 


5 


35.0 


0.80 


3 


19.0 


0.76 


4 


34.5 


0.62 


3 


18.0 


0.64 


7 


35.0 


0.45 


16 

c 


22.0 
oo n 


0.77 


8 

n 


35.0 


0.85 


5 

11 


*U 

22.0 


u.vo 

0.70 


8 


35.0 


6.85 


12 


22.0 


0.83 


1 


35.0 


0.85 


4 


22.0 


0.70 


7 


35.0 


0.85 


8 


19.5 


0.77 


13 


35.0 


0.94 


2 


24.0 


1.05 


7 


35.0 


0.85 


6 


18.5 


0.68 


6 


35.0 


1.02 


5 


18.0 


0.71 


4 


34.5 


0.97 


. 3 


24.0 


1.14 




35.0 


0.94 


. 11 


21.0 
oo n 


1.04 


9 

n 


35.0 


1.11 


3 

8 


22 eU 
21.0 


0.96 


u 

9 


35.0 


i.ii 


6 


21.5 


1.04 


1 


34.0 


1.39 


. 5 


21.0 


1.04 


8 


35.0 


1.08 



Gaik 

Diir. 



+0.17 

+0.09 

-0.04 

+0.14 

+0.19 

-0.08 



-0.15 

- 0.02 

-0.15 

-0.17 

+0.20 

-0.34 

-0.26 

+0.20 

-0.07 



-0.15 

-0.35 

-0.04 



Mank* 
Whitnev 
U 



25 (13)* 
1 ( 0 ) 
15(3) 

Ifi 

62 (31) 



35 (19) 
6 ( 0 ) 
16(3) 

31(5) 

45 (23) 



53(15) 

1 ( 0 ) 

22 ( 6 ) 



• Figure in parentheses is critical value of V at .05 level for two-tailed test. Probability of any larger U exceeds 
.05; hence none of differences shown is significant at that level. 



that a comparison of more extreme sizes would reveal a relationship 
tliat is obscured in the above analysis. To test this possibility, the 
largest and smallest classes were isolated for separate comparison. 
None of the largest classes contained fewer than 34 students, and 
none of die smallest contained more than 24. Their median sizes and 
median achievement gains are given in Table 7. 
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A Mann-Whitney t/-test® was performed on the distribution for 
each subject in both grades. The results confirmed the general find- 
ing of the study: There was no instance of a significant difference 
in achievement gain even between these extreme groups. The me- 
dians for subgroupings classified by initial achievement level and 
variability are reported merely to indicate how negligible the differ- 
ences were. In most instances, however, the number of cases in these 
categories is so small that it would be impossible for a significant 
difference to manifest itself. 

CONCLUSIONS AND DISCUSSION 

An examination of the achievement gains made by some 7,500 
seventh- and eighth-grade pupils in 265 English and mathematics 
classes reveals no consistent effect of class size on the gains. Of 
course these tests do not measure all of the types of achievement 
toward which instruction was directed, nor do they reveal other 
benefits which might accrue to students from being in small classes. 
But insofar as improvement on the Iowa Test of Basic Skills is con- 
cerned, it does not seem to make any difference how large the classes 
are, within the limits included in this sample. It should not be con- 
cluded, of course, that the same results would be obtained if class 
size were allowed to go as high as one hundred or two hundred.. 
Neither do the findings rule out the possibility that higher achieve- 
ment gains might, with different procedures, be attained by smaller 
classes. All that the present data indicate is that, among the classes 
studied, such superior attainment was not in evidence. 

Inasmuch as essentially the same data were used in this study as 
in Millman and Johnson's,^ again no consistent relation is revealed 
between class variability and achievement gains. Oddly, it is only 
for mathematics that even initial achievement level is consistently 
related to gains. 

Considering the small differences entailed and the interaction of 
numerous factors, it does not mean much to conclude from this study 
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that large heterogeneous mathematics sections should be advocated 
for below-average eighth-grade students. Nevertheless, the results 
suggest that uncritical worship of small classes for all subjects, 

grades, and ability levels is unjustified. 

It is important to distinguish between class size and teachers 
loads. The total number of contacts may be more significant than 
the number at any one time. Four classes of thirty may be prefer- 
able to five of twenty-four. The purpose of reducing class size must 
also be considered. Just because reading scores are not improved by 
reducing the sizes of English classes does not mean that writing 

ability will not thereby be improved. 

Secondary school teachers need more time for scholarly activities. 
If their schedules called for fewer but larger classes, some time 
might be gained for this purpose. It is also conceivable that in some 
instances generally larger classes and total loads, with appropriate 
additional remuneration, might both be economically advantageous 
^d relieve teacher shortages in critical fields. Experimentation con- 
ducted free of the prevailing bias against large classes is badly 
needed. The results of the present study tend to undermine one of 
the bases of the bias. 

It may be noted also that the results apparently accord with re- 
cent findings in Sweden. Torsten Hus4n» has reported a study by 
Sixten MarWund® that bore upon a proposal in the Parliament to 
“increase the educational productivity” of the country by “a succes- 
sive reduction in class size.” Hus4n states that “Marldund was able 
to make it quite clear that within the range under consideration 
there was no evidence to support such a prediction.” 
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